Taxonomy-based Adaptive Web Search Method

نویسندگان

  • Said Mirza Pahlevi
  • Hiroyuki Kitagawa
چکیده

Current crawler-based search engines usually return a long list of search results containing a lot of noise documents. By indexing collected documents on topic path in taxonomy, taxonomy-based search engines can improve the search result qualities. However, the searches are limited to the locally compiled databases. In this paper, we propose an adaptive web search method to improve the search result qualities enabling the users to search in many databases existing in the web space. The method has a characteristic that combines the taxonomy-based search engines and a machine learning technique. More specifically, we construct a rule-based classifier using pre-classified documents provided by a taxonomy-based search engine based on a selected context category on its taxonomy, and then use it to modify the user query. The resulting modified query will be sent to the crawler-based search engines and the returned results will be presented to the user. We evaluate the effectiveness of our method by showing that the returned results from the modified query almost contain documents that will be categorized into the selected context category.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Taxonomy-based Focused Retrieval Method for the Web Space

The problem of word ambiguity is fundamental to information retrieval in the web space. This problem originates from the use of very short queries which is common in web information retrieval [1]. One way to deal with this issue is to provide taxonomy to the user so that the user can express his/her query intent to the system by using it. This approach is taken by existing taxonomy (directory)-...

متن کامل

SAGE Agent for the SATELIT Web-based system

This article presents SAGE, an adaptive interface agent for the Web-based SATELIT system. This system is dedicated to developing and browsing a Web-based catalogues in the fields of natural sciences working with taxonomies. The SAGE learning agent was developed as a part of the SATELIT adaptive interface and deals with the general hypermedia problem of « getting lost in hypermedia space ». SAGE...

متن کامل

Automatic discovery of synonyms and lexicalizations from the Web

The search of Web resources is a very important topic due to the huge amount of valuable information available in the WWW. Standard search engines can be a great help but they are often based only on the presence or absence of keywords. Thus problems regarding semantic ambiguity appear. In order to solve one of them, we propose a new method for discovering lexicalizations and synonyms of search...

متن کامل

TaxoGen: Constructing Topical Concept Taxonomy by Adaptive Term Embedding and Clustering

Taxonomy construction is not only a fundamental task for semantic analysis of text corpora, but also an important step for applications such as information filtering, recommendation, and Web search. Existing pattern-based methods extract hypernym-hyponym term pairs and then organize these pairs into a taxonomy. However, by considering each term as an independent concept node, they overlook the ...

متن کامل

A New Hybrid Method for Web Pages Ranking in Search Engines

There are many algorithms for optimizing the search engine results, ranking takes place according to one or more parameters such as; Backward Links, Forward Links, Content, click through rate and etc. The quality and performance of these algorithms depend on the listed parameters. The ranking is one of the most important components of the search engine that represents the degree of the vitality...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002